Multi-platform, multi-site, microarray-based human tumor classification.
نویسندگان
چکیده
The introduction of gene expression profiling has resulted in the production of rich human data sets with potential for deciphering tumor diagnosis, prognosis, and therapy. Here we demonstrate how artificial neural networks (ANNs) can be applied to two completely different microarray platforms (cDNA and oligonucleotide), or a combination of both, to build tumor classifiers capable of deciphering the identity of most human cancers. First, 78 tumors representing eight different types of histologically similar adenocarcinoma, were evaluated with a 32k cDNA microarray and correctly classified by a cDNA-based ANN, using independent training and test sets, with a mean accuracy of 83%. To expand our approach, oligonucleotide data derived from six independent performance sites, representing 463 tumors and 21 tumor types, were assembled, normalized, and scaled. An oligonucleotide-based ANN, trained on a random fraction of the tumors (n = 343), was 88% accurate in predicting known pathological origin of the remaining fraction of tumors (n = 120) not exposed to the training algorithm. Finally, a mixed-platform classifier using a combination of both cDNA and oligonucleotide microarray data from seven performance sites, normalized and scaled from a large and diverse tumor set (n = 539), produced similar results (85% accuracy) on independent test sets. Further validation of our classifiers was achieved by accurately (84%) predicting the known primary site of origin for an independent set of metastatic lesions (n = 50), resected from brain, lung, and liver, potentially addressing the vexing classification problems imposed by unknown primary cancers. These cDNA- and oligonucleotide-based classifiers provide a first proof of principle that data derived from multiple platforms and performance sites can be exploited to build multi-tissue tumor classifiers.
منابع مشابه
MULTI CLASS BRAIN TUMOR CLASSIFICATION OF MRI IMAGES USING HYBRID STRUCTURE DESCRIPTOR AND FUZZY LOGIC BASED RBF KERNEL SVM
Medical Image segmentation is to partition the image into a set of regions that are visually obvious and consistent with respect to some properties such as gray level, texture or color. Brain tumor classification is an imperative and difficult task in cancer radiotherapy. The objective of this research is to examine the use of pattern classification methods for distinguishing different types of...
متن کاملDevelopment of antibody-based microarray assay for quantitative detection of aflatoxin B1
BACKGROUND: Aflatoxin B1 (AFB1) is a toxic metaboliteproduced by Aspergillus species that contaminates a wide range ofagricultural products. OBJECTIVES: This study was designed todevelop a rapid and highly sensitive immunoassay method inmicroarray format for quantitative detection of AFB1 to evaluatethe potential of microarray platform for high-throughput screening,which can be beneficial in fo...
متن کاملFeature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملSite selection of water storage based on multi-criteria decision analysis
Water loss can be minimized and conserve through constructing small storage dams for various irrigation purposes to support local livelihood. Geographic information system provides powerful techniques for many hydrological modeling and suitable dam site selection. The current study explored potential sites for small storage dams to meet agricultural requirements in district Malakand, Khyber Puk...
متن کاملGIS-Based Multi Criteria Evaluation for Thermal Power Plant Site Selection in Kahnuj County, SE Iran
Choosing a location for a power plant site is a complex task that involves evaluation of multiple factors, which should satisfy a number of economic and environmental requirements. The main aim of this study is to determine the best possible candidates for thermal power plant sites using Multi-criteria Evaluation and Geographic Information System (GIS) in Kahnuj County in the southeast of Iran....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- The American journal of pathology
دوره 164 1 شماره
صفحات -
تاریخ انتشار 2004